<html>

<body class="help">

<center>

<h1>DatasetExplorer</h1>

<p>
<b>DatasetExplorer</b> is an application using the <b>Galatee</b> library for exploring (browsing and searching in) <b>collections of annotated images</b>. DatasetExplorer and the Galatee library are dedicated to explore machine learning datasets (for content-based image annotation and retrieval).
</p>

<p>
<a href="http://njames.trevize.net/wiki/projects:DatasetExplorer">http://njames.trevize.net/wiki/projects:DatasetExplorer</a>
</p>

<p>
<a href="http://njames.trevize.net/wiki/projects:Galatee">http://njames.trevize.net/wiki/projects:Galatee</a>
</p>

<p>
copyright &copy; 2010 Nicolas James
</p>

</center>

<p>
DatasetExplorer also includes the <b>Jmagine</b> library that is used to display bounding boxes or named polygons (as available in the LabelMe or PascalVOC datasets) and using the SVG specification.
</p>

<p>
An image collections can be represented by:

<ul>

<li><b>a directory (with eventually sub-directories)</b>.</li>

<li><b>a TAR archive:</b> As for instance in the ImageNet image database.<br/>
The tar archive is not unpacked, the Galatee library uses Apache Commons VFS for getting data directly from the tar file.</li>

<li><b>a text file that contains filepath to images:</b> the file can contain relative paths, in this case you have to specify a path prefix.<br/>
Relative paths are very useful if your dataset is on an external hard-drive or if you work on several machines and the location of a dataset is not the same between the machines.<br/>
A very important thing about this kind of dataset format is that <b>you can add textual annotations in the file</b>, here is an example of such a file: it is the two first images that populate the concept 042_Castle in LSCOM annotation v1.0. Each image is annotated by the other LSCOM concepts for which the image is an instance (but for this example here, we illustrate with few concepts only):<br/>
<blockquote>
TRECVID2005_145/shot145_102_RKF.jpg,140_Steeple,107_Standing,101_Urban_Park,224_Outdoor
<br/>
TRECVID2005_148/shot148_491_RKF.jpg,153_Landscape,309_Free_Standing_Structures,442_Sidewalks,235_Vegetation
</blockquote></li>

<li><b>a text file that contains URI to images:</b> the files are downloaded in a temporary directory. As previously, you can specify an URI prefix, so the file can contains relative paths.<br/>
As previously you can add textual annotations in the text file (with the same format that previously).</li>

<li><b>a location that contains an instance of an IIDF model</b>.</li>

</ul>

</p>

<br/>

<hr/>

<h1>Usage</h1>

<p>
If you are running Galatee or DatasetExplorer under a GNU/Linux or Unix platform:

<ul>

<li>if you have the ImageMagick project already installed, you have a contextual menu that appears over an image 
after a mouse right click, that contains <b>ImageInfo</b> for getting informations about the image.</li>

<li>if you have the OpenCV library already installed you can also install some Galatee plugins (see the website
of the project) for face detection, face and eye detection etc. You can easily add new plugins.</li>

</ul>

</p>

<p>
Make a search in the <i>Search</i> tab, click the previous and next button to explore the results.
</p>

<h1>Keyboard shortcuts</h1>

<ul>
<li><b>+ and -</b>: add or remove a column in the browser.</li>
<li><b>*</b>: show / unshow the textual description.</li>
<li><b>CTRL-p</b>: display the image preferences dialog (with / height of images, width of the textual description area).</li>
<li><b>CTRL-f</b>: display the search panel.</li>
<li><b>F11</b>: fullscreen mode.</li>
</ul>

</body>

</html>
